Diagnostic control compositions

ABSTRACT

Provided herein is a nucleotide cassette comprising an inducible promoter, a nucleotide sequence that corresponds to at least one single stranded RNA diagnostic target, a nucleotide sequence that encodes artemin, a molecular switch and a nucleotide sequence that encodes a DNAse enzyme and is under control of the molecular switch, wherein the single stranded RNA diagnostic target is a sequence detected by a molecular diagnostic assay. In some embodiments the nucleotide cassette can be used to obtain an RNA expression product. Also provided are vectors and cells comprising the nucleotide cassette or the RNA expression product thereof. The nucleotide cassette can further be used to obtain a diagnostic control composition comprising a non-pathogenic recombinant bacterium having a modified genetic content comprising the nucleotide cassette and to methods of producing such recombinant bacteria.

CROSS REFERENCE TO RELATED APPLICATIONS

The present application is a U.S. National Phase Application of PCT International Application No. PCT/M2021/053369, filed on Apr. 23, 2021 with the International Bureau, which is an International Application of and claims priority to GB Patent Application Serial No. 2005985.3, filed on Apr. 23, 2020. The contents of the above-referenced applications are herein expressly incorporated by reference in their entireties, including any drawings.

REFERENCE TO A SEQUENCE LISTING

The material in the accompanying Sequence Listing is hereby incorporated by reference into this application. The accompanying Sequence Listing text file, named “Sequence_Listing_057095-502N01US_ST25.TXT” was created on Oct. 21, 2022 and is 80,838 bytes.

BACKGROUND OF THE INVENTION

Testing for viral infections in many cases requires RNA-based positive controls for testing of instrument suitability, proficiency testing and external quality assurance systems. In a number of cases, the actual virus is used a control which creates numerous occupational health and safety issues and unnecessary exposure of laboratory staff to dangerous human pathogens. Most of these viral controls require specialist shipping and handling procedures and need highly skilled staff and infrastructure, which limits their use. Proficiency testing controls are central to the deployment of any successful diagnostic test and indeed the lack of these remains a significant hurdle to the use of new testing kits as they become available.

The newly emergent severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has resulted in a global pandemic prompting radical measures to contain the spread of the virus. Coronavirus disease 2019 (COVID-19) is the infectious disease caused by SARS-CoV-2. The disease was first identified in December 2019 in Wuhan, China, and has since spread globally, resulting in a global pandemic. Common symptoms of COVID-19 include fever, cough and shortness of breath. Other symptoms may include fatigue, muscle pain, diarrhea, sore throat, loss of smell and abdominal pain. The time from exposure to the onset of symptoms is typically five days but may range from two to fourteen days. The majority of cases result in relatively mild symptoms, however some patients progress to viral pneumonia and multi-organ failure.

The rapid spread of SARS-CoV-2 has created a global pandemic, resulting in a grim forecast for the global economy and public health programs of even the most well-resourced nations. Whilst most countries continue to battle against raging local epidemics, with the precise formula for rapidly achieving control remaining elusive, it has been demonstrated that smart, aggressive public health interventions can reduce the rate of new infections. Countries such as China and South Korea have been able to contain the spread of new infections through rapidly deploying testing for Covid-19 infection and disease. When these diagnostics were coupled with effective public health interventions such as isolation and quarantine, a significant flattening of the curve was achieved. Diagnostic testing can adopt one of two modalities, either directed at detecting the nucleic acid (or genetic content) of the virus, a method commonly referred to as RT-PCR, or through serological testing aimed at detecting antibodies against certain viral proteins. RT-PCR tests have the capacity for detecting the virus early in the infection, in individuals who are asymptomatic, and allows for reduction in spread of new infections. A major challenge in containing the spread of the virus has been identifying asymptomatic infections which are reported to be major drivers of the pandemic. As such the demand for RT-PCR tests has increased exponentially all over the world. However, as these become available the need for proficiency testing tools also becomes an urgent priority. Proficiency testing controls allow for an assessment of whether a test is fit for purpose and provides valuable information on the ability of laboratories to carry out testing in a quality assured manner. These controls usually encompass both positive and negative controls. In many cases the disease-causing agent is used as the positive control, for example the single stranded RNA virus. Using the live virus as a positive control for evaluating test performance in this case is not desirable as the corona virus has a rapid, explosive infection force and it's use in this case creates notable public health concerns.

The inventors have thus employed a biomimicry approach to engineer a positive control organism that mimics the genetic material of the SARS-CoV-2 virus and other single stranded RNA viruses, including human immunodeficiency virus (HIV) and hepatitis C virus (HCV), that is targeted by several RT-PCR based diagnostic kits to create a safe, non-infectious and stable positive control that can be rapidly deployed in any setting. The approach uses a Mycobacterium smegmatis cell, as an encapsulating casing, which contains a plasmid DNA molecule that has the viral diagnostic targets that mimic the diagnostic profile of the clinical pathogen. Because the genetic material that forms the basis of the molecular assay is RNA, the plasmid DNA further includes elements which allow for RNA expression, stabilisation of the RNA and degradation of the remaining DNA, thereby ensuring that the positive control is an effective control that mimics the diagnostic profile of the virus.

SUMMARY OF THE INVENTION

The present invention relates to a nucleotide cassette comprising a nucleotide sequence corresponding to at least one target nucleic acid from a single stranded RNA diagnostic target. This nucleotide sequence contains an inducible promoter, a nucleotide sequence that encodes artemin, a molecular switch and a nucleotide sequence that encodes a DNAse enzyme and is under control of the molecular switch, wherein the single stranded RNA diagnostic target is converted to an RNA sequence target that is detected by a molecular diagnostic assay. The invention also relates to an RNA expression product of the nucleotide cassette, a vector comprising the nucleotide cassette, and to a cell carrying the nucleotide cassette, the RNA expression product or the vector. In addition, this invention relates to a diagnostic control composition comprising a non-pathogenic recombinant bacterium having a modified genetic content comprising the nucleotide cassette, as well as to methods of producing these recombinant bacteria. The invention also relates to a kit comprising the cell, diagnostic control composition or recombinant bacterium produced according to the method.

According to a first aspect of the present invention there is provided for a nucleotide cassette having the formula: X₁-X₂-X₃-X₄-X₅ wherein, X₁ is an inducible promoter; X₂ is a nucleotide sequence corresponding to at least one single stranded RNA diagnostic target; X₃ is a nucleotide sequence that encodes artemin; X₄ is a molecular switch; and X₅ is a nucleotide sequence that encodes a DNAse enzyme and is under control of the molecular switch, wherein the single stranded RNA diagnostic target is a sequence detected by a molecular diagnostic assay. In one embodiment, the single stranded RNA diagnostic target is a single stranded RNA virus target, such as a SARS-CoV-2 diagnostic target detected by a molecular diagnostic assay that detects SARS-CoV-2.

In a first embodiment of the nucleotide cassette of the invention, the SARS-CoV-2 target may be selected from the group consisting of genes encoding RdRP1, RdRP2, N protein, E protein, Spike protein, ORF1ab, and combinations thereof. It will be appreciated by those of skill in the art that other regions of the SARS-CoV-2 virus may be contemplated if these are detected in molecular diagnostic tests. For example, the SARS-CoV-2 target may be any sequence in the SARS-CoV-2 genome, which is represented by SEQ ID NOs:18-28. In an alternative embodiment, the target may be one or more other single stranded RNA diagnostic targets that are detected by a molecular diagnostic assay. Non-limiting examples of such targets include single stranded virus targets detected by a diagnostic assay for HCV, HIV, respiratory syncytial virus, influenza virus and Ebola virus. It will be appreciated by those of skill in the art that any single stranded RNA diagnostic target of a molecular diagnostic assay could be used, including the BRCA-ABL human ss-mRNA.

In a second embodiment of the nucleotide cassette of the invention, X₂ is selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or a sequence substantially identical or complementary to any of SEQ ID NOs:2-7, and combinations thereof.

According to a third embodiment of the nucleotide cassette of the invention the inducible promoter is a heat inducible promoter, preferably Hsp60.

In a third embodiment of the nucleotide cassette of the invention the molecular switch may be a theophylline riboswitch that is activated via introduction and binding of theophylline, and which induces the production of DNAse enzyme.

In a further embodiment of the invention the nucleotide cassette comprises the sequence of SEQ ID NO:14 or SEQ ID NO:15, or a sequence substantially identical or complementary thereto.

According to a second aspect of the invention there is provided for an RNA expression product produced by the nucleotide cassette of the first aspect.

According to a third aspect of the present invention there is provided for a vector comprising the nucleotide cassette of the first aspect.

According to a fourth aspect of the present invention there is provided for a cell comprising the nucleotide cassette of the first aspect, the RNA expression product of the second aspect, or the vector of the third aspect.

In a second embodiment of the cell of the invention, the cell is a recombinant Mycobacterium smegmatis cell.

According to a fifth aspect of the present invention there is provided for a diagnostic control composition comprising a non-pathogenic recombinant bacterium having a modified genetic content comprising a nucleotide cassette having the formula: X₁-X₂-X₃-X₄-X₅ wherein, X₁ is an inducible promoter; X₂ is a nucleotide sequence corresponding to at least one single stranded RNA diagnostic target; X₃ is a nucleotide sequence that encodes artemin; X₄ is a molecular switch; and X₅ is a nucleotide sequence that encodes a DNAse enzyme and is under control of the molecular switch, wherein the single stranded RNA diagnostic target is a sequence detected by a molecular diagnostic assay and wherein the diagnostic control composition mimics the diagnostic profile of the single stranded RNA diagnostic target. In one embodiment, the single stranded RNA diagnostic target is a single stranded RNA virus target, such as a SARS-CoV-2 diagnostic target detected by a molecular diagnostic assay that detects SARS-CoV-2.

According to a first embodiment of the diagnostic control composition of the present invention the SARS-CoV-2 target may be selected from the group consisting of genes encoding RdRP1, RdRP2, N protein, E protein, Spike protein, ORF1ab, and combinations thereof. It will be appreciated by those of skill in the art that other regions of the SARS-CoV-2 virus may be contemplated by the present invention if these are to be detected in molecular diagnostic tests. For example, the SARS-CoV-2 target may be any sequence in the SARS-CoV-2 genome, which is represented by SEQ ID NOs:18-28. In an alternative embodiment, the target may be one or more other single stranded RNA diagnostic targets that are detected by a molecular diagnostic assay. Non-limiting examples of such targets include single stranded virus targets detected by a diagnostic assay for HCV, HIV, respiratory syncytial virus, influenza virus and Ebola virus. It will be appreciated by those of skill in the art that any single stranded RNA diagnostic target of a molecular diagnostic assay could be used, including the BRCA-ABL human ss-mRNA.

In a second embodiment of the diagnostic control composition, X₂ is selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or a sequence substantially identical or complementary to any of SEQ ID NOs:2-7, and combinations thereof.

According to a third embodiment of the diagnostic control composition of the invention the inducible promoter is a heat inducible promoter, preferably Hsp60.

In another embodiment of the diagnostic control composition, the molecular switch may be a theophylline riboswitch that is activated via introduction and binding of theophylline, and which induces the production of DNAse enzyme.

In a further embodiment of the diagnostic control composition of the invention the nucleotide cassette may comprise the sequence of SEQ ID NO:14 or SEQ ID NO:15, or a sequence substantially identical or complementary thereto.

In yet another embodiment of the diagnostic control composition of the present invention the non-pathogenic recombinant bacterium may be Mycobacterium smegmatis.

According to a further aspect of the present invention there is provided for a method of producing a recombinant bacterium that mimics the diagnostic profile of a single stranded RNA of interest in a molecular diagnostic assay, the method comprising:

(i) transforming a non-pathogenic bacterium with a vector comprising the nucleotide cassette described herein and a selection marker to obtain a recombinant bacterium; and

(ii) culturing the recombinant bacterium obtained in step (i) under selective conditions in order to select the recombinant bacterium.

In one embodiment of the method of the invention the non-pathogenic bacterium is Mycobacterium smegmatis.

In a further embodiment of the method of the invention the recombinant bacterium mimics the diagnostic profile of SARS-CoV-2. In an alternative embodiment, the recombinant bacterium mimics one or more other single stranded RNA viruses that are detected by a molecular diagnostic assay. IN yet a further, embodiment, the single stranded RNA of interest may be BRCA-ABL human ss-mRNA.

In another embodiment of the method of the present invention the selection marker may be an antibiotic selection marker.

In yet a further embodiment of the method of the invention the recombinant bacterium may be either stably or transiently transformed with the vector.

According to yet another aspect of the present invention there is provided for a kit comprising a cell of the invention, a diagnostic control composition of the invention, or the recombinant bacterium produced according to the method of the invention, and instructions for use.

BRIEF DESCRIPTION OF THE FIGURES

Non-limiting embodiments of the invention will now be described by way of example only and with reference to the following figures:

FIG. 1 : Plasmid map for covid-19 control. Mycobacterium smegmatis cells were transformed using a plasmid comprising elements 1 to 8 as shown. Numbers indicate the different genetic elements encoded by the plasmid: 1) promoter; 2) at least one CoV-2 target; 3) artemin; 4) DNAse; 5) plasmid backbone allowing for plasmid replication with a standard pUC Escherichia coli replicon; 6) kanamycin resistance; 7) mycobacterial origin of replication; and 8) theophylline riboswitch.

FIG. 2 : The SARS-CoV-2 DNA cassette for mimicking the diagnostic profile of the clinical pathogen.

FIG. 3 : RNA expression of the SARS-CoV-2 DNA cassette, the artemin riboswitch and DNAse genes, is induced by increasing the temperature as the elements in the cassette are under the control of a mycobacterium heat responsive promoter. This results in the expression of an RNA cassette comprising the SARS-CoV-2 target nucleic acids, artemin, the theophylline riboswitch and the DNAse. The artemin stabilises the SARS-CoV-2 targets nucleic acids. Thereafter, DNAse translation into protein is induced by the addition of theophylline and the DNA is degraded.

FIG. 4 : Nucleotide sequence of the expression cassette for targets RdRp1, RdRp2, N protein and E protein (SEQ ID NO:14).

FIG. 5 : Nucleotide sequence of the expression cassette for targets Spike protein and ORF1lab 5′ region (SEQ ID NO:15).

FIG. 6 : Nucleotide sequence of RNA expression product for targets RdRp1, RdRp2, N protein and E protein (SEQ ID NO:16).

FIG. 7 : Nucleotide sequence of RNA expression product for targets Spike protein and ORF1lab 5′ region (SEQ ID NO:17).

SEQUENCE LISTING

The nucleic acid sequences listed in the accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases. It will be understood by those of skill in the art that only one strand of each nucleic acid sequence is shown, but that the complementary strand is included by any reference to the displayed strand. In the accompanying sequence listing:

-   SEQ ID NO:1—Nucleotide sequence of the Hsp60 promoter -   SEQ ID NO:2—Nucleotide sequence of SARS CoV-2 target RdRp1 -   SEQ ID NO:3—Nucleotide sequence of SARS CoV-2 target RdRp2 -   SEQ ID NO:4—Nucleotide sequence of SARS CoV-2 target N protein -   SEQ ID NO:5—Nucleotide sequence of SARS CoV-2 target E protein -   SEQ ID NO:6—Nucleotide sequence of SARS CoV-2 target Spike protein -   SEQ ID NO:7—Nucleotide sequence of SARS CoV-2 target ORF1lab 5′     region -   SEQ ID NO:8—Nucleotide sequence of artemin -   SEQ ID NO:9—Nucleotide sequence of DNAse -   SEQ ID NO:10—Nucleotide sequence of plasmid backbone -   SEQ ID NO:11—Nucleotide sequence of kanamycin resistance cassette -   SEQ ID NO:12—Nucleotide sequence of mycobacterial origin of     replication -   SEQ ID NO:13—Nucleotide sequence of theophylline riboswitch -   SEQ ID NO:14—Nucleotide sequence of the expression cassette for     targets RdRp1, RdRp2, N protein and E protein -   SEQ ID NO:15—Nucleotide sequence of the expression cassette for     targets Spike protein, ORF1lab 5′ region, RdRp1, RdRp2, N protein     and E protein -   SEQ ID NO:16—Nucleotide sequence of RNA expression product for     targets RdRp1, RdRp2, N protein and E protein -   SEQ ID NO:17—Nucleotide sequence of RNA expression product for     targets Spike protein, ORF1ab 5′ region, RdRp1, RdRp2, N protein and     E protein -   SEQ ID NO:18—Nucleotide Sequence of fragment 1 of SARS-CoV-2 genome -   SEQ ID NO:19—Nucleotide Sequence of fragment 2 of SARS-CoV-2 genome -   SEQ ID NO:20—Nucleotide Sequence of fragment 3 of SARS-CoV-2 genome -   SEQ ID NO:21—Nucleotide Sequence of fragment 4 of SARS-CoV-2 genome -   SEQ ID NO:22—Nucleotide Sequence of fragment 5 of SARS-CoV-2 genome -   SEQ ID NO:23—Nucleotide Sequence of fragment 6 of SARS-CoV-2 genome -   SEQ ID NO:24—Nucleotide Sequence of fragment 7 of SARS-CoV-2 genome -   SEQ ID NO:25—Nucleotide Sequence of fragment 8 of SARS-CoV-2 genome -   SEQ ID NO:26—Nucleotide Sequence of fragment 9 of SARS-CoV-2 genome -   SEQ ID NO:27—Nucleotide Sequence of fragment 10 of SARS-CoV-2 genome -   SEQ ID NO:28—Nucleotide Sequence of fragment 11 of SARS-CoV-2     genome.

DETAILED DESCRIPTION OF THE INVENTION

The present invention will now be described more fully hereinafter with reference to the accompanying drawings, in which some, but not all embodiments of the invention are shown.

The invention as described should not be limited to the specific embodiments disclosed and modifications and other embodiments are intended to be included within the scope of the invention. Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.

As used throughout this specification and in the claims which follow, the singular forms “a”, “an” and “the” include the plural form, unless the context clearly indicates otherwise.

The terminology and phraseology used herein is for the purpose of description and should not be regarded as limiting. The use of the terms “comprising”, “containing”, “having” and “including” and variations thereof used herein, are meant to encompass the items listed thereafter and equivalents thereof as well as additional items.

The invention relates to a recombinant bacterium that mimics the diagnostic profile of one or more single stranded RNA diagnostic targets of interest that are detected by a molecular diagnostic assay, in particular a single stranded RNA virus, including SARS-CoV-2. The recombinant bacterium of the invention is based on a non-pathogenic bacterium that has a modified genetic content containing a target nucleic acid from a single stranded RNA diagnostic target, such as an ssRNA virus like SARS-CoV-2, wherein the target nucleic acid is detected by a molecular diagnostic assay. The approach of the inventors is to first culture the biomimetic Mycobacterium smegmatum comprising the DNA cassette to increase the biomass, then transfer to a high temperature to increase RNA expression of the single stranded RNA targets, and the artemin and DNAse genes. Once the artemin is expressed, it binds and stabilizes the RNA of the targets. Thereafter, theophylline is added to induce production of the DNAse, and the temperature is raised to activate the DNase. Once the DNAse is expressed it degrades the DNA, leaving only RNA in an encapsulated bacterial cell. This serves as a safe, non-infectious control organism for proficiency testing of molecular diagnostic assays for the single stranded RNA diagnostic targets, for example SARS-CoV-2 diagnostics. The process is outlined in FIG. 3 .

The recombinant bacterium of the present invention may be used as both an internal and external control for single stranded RNA diagnostic assays. Specifically, the recombinant Mycobacterium smegmatis of the present invention is used as an internal control for measuring efficacy of RNA extraction and to detect the presence of inhibitors of PCR, however, it is most importantly a positive control for the binding of the specific probes employed in the diagnostic assays, i.e. whether the test is fit for purpose, as well as of the handling of the sample by laboratory staff. In this way, the present invention represents an external control used to determine quality assurance of the testing program. Further, according to the present invention, the clinical sample is not spiked with the recombinant bacterium of the invention, rather the two samples are tested side-by-side.

The controls described herein focus on those required for test validation and proficiency testing and include validation controls, proficiency testing controls and external quality assessment (EQA) controls. Validation controls are required for two main reasons (i) when a test is conducted for the first time in a reference laboratory or a network of reference laboratories, positive and negative controls are required to determine if the test is fit-for-purpose and produces the correct result and (ii) as new testing kits emerge in the market, their utility can only be assessed by using controls that best mimic clinical specimens. Proficiency testing controls are useful as Covid-19 testing is rolled out to different reference labs, it becomes important to assess the proficiency of the lab testing system, including staff and infrastructure, in executing test procedures. For this, controlled specimens, where the expected outcome of the test result is known, are necessary. Finally, EQA controls are required in order to determine if laboratories are handling specimens correctly and executing test procedures appropriately. Again, it is important to develop controls that mimic clinical specimens.

In the case of viruses, a clinical specimen for the molecular diagnostic assay would comprise intact virus, for example SARS-CoV-2. As this poses numerous safety concerns, laboratory products that best mimic these specimens are more appropriate. For genetic testing, the viral RNA (inactivate) can serve as an appropriate control. Alternatively, plasmid DNA (naked or encapsulated) can serve as controls. In the case of serological tests, live viral particles, or suspensions that most closely mimic this are required.

The term “recombinant” means that something has been recombined. When used with reference to a nucleic acid construct, the term refers to a molecule that comprises nucleic acid sequences that are joined together or produced by means of molecular biological techniques. Recombinant nucleic acid constructs may include a nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature. Accordingly, a recombinant nucleic acid construct indicates that the nucleic acid molecule has been manipulated using genetic engineering, i.e. by human intervention. Recombinant nucleic acid constructs may be introduced into a host cell by transformation and preferably by transformation with a vector.

The term “vector” refers to a means by which polynucleotides or gene sequences can be introduced into a cell. There are various types of vectors known in the art including plasmids, viruses, bacteriophages and cosmids. Generally, polynucleotides or gene sequences are introduced into a vector by means of recombinant DNA technology.

The recombinant bacterium of the invention is preferably a recombinant Mycobacterium smegmatis bacterium. Mycobacterium smegmatis is preferred as this bacterium is non-pathogenic and does not require high levels of biosafety for its propagation. It will be appreciated that the recombinant bacterium of the invention preferably has a modified genetic content containing a target nucleic acid which is detected by a diagnostic assay. However, due to the incorporation of the target nucleic acid sequence of the single stranded RNA diagnostic target that is detected by the diagnostic assay, the recombinant Mycobacterium smegmatis mimics the diagnostic profile of the single stranded RNA diagnostic target, for example a virus, including SARS-CoV-2 pathogen.

As used herein the terms “nucleic acid”, “nucleic acid molecule” or “polynucleotide” encompass both ribonucleotides (RNA) and deoxyribonucleotides (DNA), including cDNA, genomic DNA, and synthetic DNA. A nucleic acid may be double-stranded or single-stranded. Where the nucleic acid is single-stranded, the nucleic acid may be the sense strand or the antisense strand. A nucleic acid molecule may be any chain of two or more covalently bonded nucleotides, including naturally occurring or non-naturally occurring nucleotides, or nucleotide analogs or derivatives. The term “DNA” refers to a sequence of two or more covalently bonded, naturally occurring or modified deoxyribonucleotides.

The recombinant Mycobacterium smegmatis as described herein may be used as a control for the detection of in a diagnostic assay for a single stranded RNA virus, such as SARS-CoV-2. Further, the recombinant bacterium may be used as a control for the detection of other RNA targets in a molecular diagnostic assay based on the detection using probes or amplification of single stranded RNA, such as the BRCA-ABL human ss-mRNA

Particularly, the recombinant Mycobacterium smegmatis may be used for the calibration of diagnostic devices used to diagnose a single stranded RNA virus, such as SARS-CoV-2, in a subject and for active surveillance monitoring of single stranded RNA viral infections, including SARS-CoV-2 infections. For example, the recombinant Mycobacterium smegmatis may be used as a control in the GeneXpert® system, Thermo Fisher, Cobas SARS-CoV-2, Abbot Realtime SARS-CoV-2, BGI, CDC, SeeGene, Tib Mol Bio and Ultragene ABL assays.

In one embodiment of the invention the recombinant Mycobacterium smegmatis is used for the purposes of external quality assessment (EQA) of the GeneXpert® modular cartridge system. The GeneXpert® system relies on the use of complementary nucleic acid probes to detect the presence or absence of the target single stranded RNA, for example a SARS-CoV-2 nucleic acid or even the BRCA-ABL human ss-mRNA. The present invention includes the same target nucleic acids as the GeneXpert® system using the same complementary nucleic acid probes.

As used herein, the term “mimics the diagnostic profile” means that the recombinant bacterium has the same diagnostic profile as the clinical sample containing the single stranded RNA, indistinguishably, and with the same specificity in an assay, when using the same detection probes or primers for both the clinical sample and the recombinant bacterium. This diagnostic profile that is mimicked may be, for example, a GeneXpert® assay readout.

As used herein the term “single stranded RNA diagnostic target” refers to a target region of single stranded RNA that is detected by a molecular diagnostic assay, in particular a nucleic acid assay that directly detects the presence of the RNA using either a probe or primer that specifically binds the region of single stranded RNA. Preferably the assay is an RT-PCR assay.

The term “complementary” refers to two nucleic acids molecules, e.g., DNA or RNA, which are capable of forming Watson-Crick base pairs to produce a region of double-strandedness between the two nucleic acid molecules. It will be appreciated by those of skill in the art that each nucleotide in a nucleic acid molecule need not form a matched Watson-Crick base pair with a nucleotide in an opposing complementary strand to form a duplex. One nucleic acid molecule is thus “complementary” to a second nucleic acid molecule if it hybridizes, under conditions of high stringency, with the second nucleic acid molecule. A nucleic acid molecule according to the invention includes both complementary molecules.

As used herein a “substantially identical” sequence is an amino acid or nucleotide sequence that differs from a reference sequence only by one or more conservative substitutions, or by one or more non-conservative substitutions, deletions, or insertions located at positions of the sequence that do not destroy or substantially reduce the binding of one or more of the probes for the targets in the molecular diagnostic assay. Alignment for purposes of determining percent sequence identity can be achieved in various ways that are within the knowledge of those with skill in the art. These include using, for instance, computer software such as ALIGN, Megalign (DNASTAR), CLUSTALW or BLAST software. Those skilled in the art can readily determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. In one embodiment of the invention there is provided for a polynucleotide sequence that has at least about 80% sequence identity, at least about 90% sequence identity, or even greater sequence identity, such as about 95%, about 96%, about 97%, about 98% or about 99% sequence identity to the sequences described herein.

Alternatively, or additionally, two nucleic acid sequences may be “substantially identical” if they hybridize under high stringency conditions. The “stringency” of a hybridisation reaction is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation which depends upon probe length, washing temperature, and salt concentration. In general, longer probes required higher temperatures for proper annealing, while shorter probes require lower temperatures. Hybridisation generally depends on the ability of denatured DNA to re-anneal when complementary strands are present in an environment below their melting temperature. A typical example of such “stringent” hybridisation conditions would be hybridisation carried out for 18 hours at 65° C. with gentle shaking, a first wash for 12 min at 65° C. in Wash Buffer A (0.5% SDS; 2×SSC), and a second wash for 10 min at 65° C. in Wash Buffer B (0.1% SDS; 0.5% SSC).

Herein, the inventors have generated strains of a non-pathogenic Mycobacterium smegmatis comprising a plasmid that includes target nucleic acids of diagnostic RT-PCR based tests inserted into the plasmid.

The following examples are offered by way of illustration and not by way of limitation.

Example 1 Bacterial Strains and Culture Conditions

All cloning was performed in Escherichia coli strain DH5α. Experiments were performed in Mycobacterium smegmatis strain mc²155. Nucleic acid sequence inserts used in the constructs are listed in Table 1. The plasmid backbone and the kanamycin resistance cassette were purchased as synthetic DNA fragments from GenScript™.

TABLE 1 Nucleic acid sequence inserts used Sequence Element Nucleic acid sequence Hsp60 promoter GTTAACACTAGTTGATTAGCTAAGCAGAAGGCCATCCTGACGGAT (SEQ ID NO: 1) GGCCTTTTTGCGTTTAATACTGTTTAAACCTCTAGAGGTGTGGTA GCCGATGCCGGTGTTGGCGCCGGTGACCACAACGACGCGCCCG CTTTGATCGGGGACGTCTGCGGCCGACCATTTACGGGTCTTGTT GTCGTTGGCGGTCATGGGCCGAACATACTCACCCGGATCGGAG GGCCGAGGACAAGGTCGAACGAGGGGCATGACCCGGTGCGGG GCTTCTTGCACTCGGCATAGGCGAGTGCTAAGAATAACGTTGGC ACTCGCGACCGGTGAGTGCTAGGTCGGGACGGTGAGGCCAGGC CCGTCGTCGCAGCGAGTGGCAGCGAGGACAACTTGAGCCGTCC GTCGCGGGCACTGCGCCCGGCCAGCGTAAGTAGCGGGGTTGCC GTCACCCGGTGACCCCCGTTTCATCCCCGATCCGCATGC SARS CoV-2 CTCTTATTGTAACAGCTTTAAGGGCCAATTCTGCTGTCAAATTACA target RdRp1 GAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGC (SEQ ID NO: 2) TGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGTTAGC TTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTT ATCCGATTTACAGGATTTGAAATGGGCT SARS CoV-2 GAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATC target RdRp2 TCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGC (SEQ ID NO: 3) CAGGTAGTGGAGTTCCTGTTGTAGATTCTTATTATTCATTGTTAAT GCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGT TGACACTGACTTAACAAAGCCTTACA SARS CoV-2 TTTAGATTTCATCTAAACGAACAAACTAAAATGTCTGATAATGGAC target N protein CCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCC (SEQ ID NO: 4) TCAGATTCAACTGGCAGTAACCAGAATGGAGAACGCAGTGGGGC GCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACTGC GTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAA ATTCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCC AGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTC GTGGTGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATT TCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGT GCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTT GAATACACCAAAAGATCACATTGGCACCCGCAATCCTGCTAACAA TGCTGCAATCGTGCTACAACTTCCTCAAGGAACAACATTGCCAAA AGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTT CTCGTTCCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTC CAGGCAGCAGTAGGGGAACTTCTCCTGCTAGAATGGCTGGCAAT GGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAA CCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAG GCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGC CTCGGCAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAG CTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATTTTGGG GACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCC GCAAATTGCACAATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAAT GTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACGTGGTTGA CCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCA AAGATCAAGTCATTTTGCTGAATAAGCATAT SARS CoV-2 CTCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGT target E protein ACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGTTACACTAGCC (SEQ ID NO: 5) ATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTT AACGTGAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTA AAAATCTGAATTCTTCTAGAGTTCCTGATCTTCTGGTCTAA SARS CoV-2 AAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAAAATTCAAG target Spike ACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATG protein TGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAAC (SEQ ID NO: 6) TTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATATCCT TTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTT GATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTCAACA ATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGC TACTAAAATGT SARS CoV-2 ACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGATGCTCGAAC target ORF1ab TGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGA 5’ region AGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTG (SEQ ID NO: 7) TCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTT CTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGG CGCCGATCTAAAGTCATTTGACTTAGGCGACGAGCTTGGCACTG ATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCA GTGGTGTTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCA TACACTCGCTATGTCGATAACAACTTCTGTGGCCCTGATGGCTAC Artem in AGGAAGGAATGTACATATGGCGACCGAGGGCGCGCGCAACATC (SEQ ID NO: 8) GGCCAGTCGGCCCCGGAGGGCAAGGTGCAGATGGACTGCCCGT CGCGCCACAACTTCGACCCGGAGTGCGAGAAGGCGTTTGTGGA GCACATCCACCTTGAGCTGGCCTCGTCGTACCACGCATGGTCGA TGTGGGCCTTCTACGCCCGCGACTGCAAGGCCGCCGTGGGCAT GACCCGCCTGTGCGAGTGGGCCTCGCACGTCTCGGCCCAGCGC GCCCGCCGCATGGCCGCCTACGTGCTGACCCGCGGCGGCCACG TGGACTACAAGGAGATCCCGGCCCCGAAGAAGCAGGGCTGGGA CAACTTCGAGGACGCCTTCTCGCACTGCGTGGCGAACAAGAAGC GCATCCTGACCTCGCTCCAGTCGCTGTACCAGTGCTGCCAGTCG AAGGACGCCCACTGCTCGAACTTCATCCAGACCGACATGATGGA CGAGGTGATCGCGTGGAACAAGTTTCTGTCGGACTGCCTGTCGA ACCTGCACTGCATCGGCTCGCAGGGCATGGGACCGTGGGTGTT CGACCGCTGGCTGGCCCGCATCGTGATGTCGAAGTTCAAGCACC CGAAGATCCCGTCGCTCTCGACCTCGGACCTAGAGTCGAACATC CCGAACGAGCTGTTCGACGCCGAGGGCGACATGGTGCGCGCCA TCAAGAAGCTGGACTACAAGGACCATGACGGTGACTATAAAGAT CACGATATAGATTACAAGGATGACGATGACAAGTGA DNAse ATGCATACCATCTACCGCATCGAGAAGAAGGAGAACTACGTGGT (SEQ ID NO: 9) GCTGGACAAGGGCTTCCTGCACGACCGCGAGCTGTCGTGGCAG GCTAAGGGCCTGCTGGCCTTCATGCTGTCGATGCCGAACGACTG GGTGTTCAACATGAAGGACCTCCAGAACCGCTCGAAGAACGGTC GCGACGCCACCTACCGCATTATGAAGGAGCTGATCGAGGCCGG CTACGTGACCCGCGTGGAGAACCGCGACGGCGGCAAGTTCGGC AAGGTGGAGTACGTGGTCCACGAGGTGAAGCAGTCGCCGCACA CCGAGTCGCCGGACACCGTGCCGCCCTGCACCGAGAACCCGTA CCCCGGCAACCCGTACCCCGGCAACCCGTACCCGGAGAACCCG CCGCTGCTGAACAACAACAACACCAACTACAAGAACACCAACAAC GACGACGACAACAAGGACCGCCCGAAGACCAACTCGCTGAACG CCTTTCGCTTCTACGAGGAGAACTTCCAGCCGACCCTGTCGTCG GTGGACATCGAGATCCTGAACTACTGGCTGGACCGCTTCCCGGA GGAGATCGTGCTGTGCGCCATGCGGAAGGCCCTAGAGCAGAAC GTGCGCTCGATCAAGTACATCGACCGCATCCTTGCCAACTGGGA GATGCAGAAGGTGCAGACCCTTGAGGACGTGGCCCGCCTCGAC CGCCAGTACGAGCTGGAGAAGCAGGCCCGCCAGAAGCGCGGCG GCGTGGTGAACGGCTCGGTCCACCAGCACCGCGGCTCGGACGG TCGCTCGACGAAGGAGGACGAGCGCATCTCGCACTACGAGCCG GGCAAGTGGGACGACGTGGACATCTCGCTGGACGGCCTGCTGC ACCATCACCACCATCACTGA Plasmid GATCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTT backbone GCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGC (SEQ ID NO: 10) GCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAG GCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAA GAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAA AGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGAC GAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCC GACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCC TCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTG TCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCA CGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCT GGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCC TTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGAC TTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGC GAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTA ACTACGGCTACACTAGAAGAACAGTATTTGGTATCTGCGCTCTGC TGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCG GCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGC AGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGA TCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTT AAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGA TCCTTTTAAATTAAAAATGAAGTTTTAAATCAAGCCCAATCTGAAT AATGTTACAACCAATTAACCAATTCTGA Kanamycin ATGAGCCATATTCAACGGGAAACGTCGAGGCCGCGATTAAATTC resistance gene CAACATGGATGCTGATTTATATGGGTATAAATGGGCTCGCGATAA (SEQ ID NO: 11) TGTCGGGCAATCAGGTGCGACAATCTATCGCTTGTATGGGAAGC CCGATGCGCCAGAGTTGTTTCTGAAACATGGCAAAGGTAGCGTT GCCAATGATGTTACAGATGAGATGGTCAGACTAAACTGGCTGAC GGAATTTATGCCTCTTCCGACCATCAAGCATTTTATCCGTACTCCT GATGATGCATGGTTACTCACCACTGCGATCCCCGGAAAAACAGC ATTCCAGGTATTAGAAGAATATCCTGATTCAGGTGAAAATATTGTT GATGCGCTGGCAGTGTTCCTGCGCCGGTTGCATTCGATTCCTGT TTGTAATTGTCCTTTTAACAGCGATCGCGTATTTCGTCTCGCTCA GGCGCAATCACGAATGAATAACGGTTTGGTTGATGCGAGTGATTT TGATGACGAGCGTAATGGCTGGCCTGTTGAACAAGTCTGGAAAG AAATGCATAAACTTTTGCCATTCTCACCGGATTCAGTCGTCACTC ATGGTGATTTCTCACTTGATAACCTTATTTTTGACGAGGGGAAATT AATAGGTTGTATTGATGTTGGACGAGTCGGAATCGCAGACCGATA CCAGGATCTTGCCATCCTATGGAACTGCCTCGGTGAGTTTTCTCC TTCATTACAGAAACGGCTTTTTCAAAAATATGGTATTGATAATCCT GATATGAATAAATTGCAGTTTCATTTGATGCTCGATGAGTTTTTCT AA Mycobacterial GGGGCCTGTAACGGCACAACGAACCGTGCAACGAGAGTGGCCA origin of CGGATGCCACCACAAGCACTACAACGGAGTTCGCCACGTACATC replication ACCACAACCACCGATTCTGGCGGTGAGCTCCACGATATTCAGCG (SEQ ID NO: 12) GAAATGGCTTGGTATCGACCAAGATTCGTAGAACCCCGTCTCGT CTGGCTGGTATTCAAAACGGACGCAACGAAACACGCAACGAGAC AGGCATGGCCCAAACCAGAAAACTAGCGTCTACCAGGACTTTTA CCTGTCCGACCCGTTGCAACGGAACCCCCCACGGAACCCCCGC GACACCCGCTCCCCAATTGCGTTAGAACAGCGGTGGATTGTCGG CTTCGTTGTGGGCCTTTTGAGCCGCTTCCTGTTCTGCCGCACGC TCTTTCCTCGCCCGATAGCCGAGTCGCTTAACGGTGTCCAGATG CAGCCCGAAATGTTTGGCCGTTTGCGGCCAAGAGTGGCCCTCGT CGTCGTGATAGGCGCGGATGCGTTCGCGGCGTGCAGCCTGCTC GGCGAGCCACTCGCTGCGTTCCTGCGCCACGAGCCGGACGACG TGGCGTTCGGATAGTCCGGTGATTCGAGCGCCTTCGGCGGCGG TCACGCGCCGCTTTTTGCGGACAGTCGGCTGCCGGTTGTAGCCG TCGCTGTAGCCGTCGCTGTAGCCGTCGCTCATAGCAATGCCTCC ATGGCTGACGCGGACTTTGCGCGCCGCGCAACTGTGCTCGCCG CCGTGCGCGCTGCTGCGCCCTTCCGCGAGATGGCCGACTGGCG CGCACTGAGTGTGGCCTCGTAGACCACGATCCCGTCCGCCCAAA TGCGCGACTTGGTTGTGATCCAACGCCAAATGCTGTTGGCGATG GCGCGGACCTCGCTGTCCGGTAGCGGTCCGGGACACACGTCGT TGCACGGGAATTCGGCGTTTCGCGCGTGGCACTCGGCATAGATC GCGCGGCCGAGTCCGTCCACGTTCCGGGTCGGCAGGTAGATCC GCATGAGGGCGGGACGATAGGCCCACAACCTGACGGAATCGAA CAGTGCGCAATTCCGCCCTAGCGGCGTCGGAGCCGCTTTGTACG TGGTCTGCTGACGCCAGCGCGGCGGTGGCATGTTCGCGCCGAG CTCGGCCTCGATGTGGCTGAGTGTGTAGAGATCTGAGTGGAGCC ATTCCGTTTCCCAGGCGATGTGGCCGGGGTTTTTGGTCATGAGG CCTGAGTAACTGCGGTCGCCGTCGACGGCGCGCCGAAGGCCTT CGGCGCACGCCGCCATGTATGCGAGCGGCTTACGCCGCGCGTA TTCGGTGCGTGGAACAGGGGCGTTGAGTGCCCACACTGCGTGT GCGTGGCCGTTGGCGCGATTGCCCACGATCGCGTTGGGCAGCG GATGGGACCCCCGGGCGCTGAGCGCTCGGAGCGCTGCGTCTGG ATGGTCTACGTCCACGACCAGCAGGTTTGCCAGCGCTGTTGGGT TCGCCTCGATGTACCGGCGGCCTAGGGCCGACGCGCGGCTTTG GCGGTAGATCCCCTCGAGCAGATCGTCGCTTGCCAGCGGCCAG TACGGCAGCCAGAGCTGCTCAAATTCGTCGGCGACGTGGCTCAC GCTTGGTAGTAGACCACGATTAATCACCGGTGTATGGTCCGACA CGAGCTCCAAGTCAGATATTTCGCTGAGGGGCCACCCCACAACT GCACACTCCCCCGCTCTCCCGTCGAGCCCTGATGATGAAACACC AGCGACAGCCGAGCACCCCCAACCACCTGTACCAACCAGGAGG AACACATGCGTCGTTTCGAGGACGTTTCCGGGCCGCTAAGAGCC GCTGTGGCGGCCGTACACGCCGCCTTAGACCCGTTAGACCCCCT GCCGCCTGAATGCGCGGGTACGAGCCACACAGCACCCGAACTT ACGGAGCTGGTGGGCTCACCTGGCTTTATGGCGTACGAATCGGC TGTGTGCGACCTGTTGGGCGAGGTGAGATACGCGCTACTCACGC TGGCAAGGGCGACACAGCCGCCCCACCGAGCCCGCACGGCCG CGCGCGGTGTCAACAACCGGGTGAGTCGTGCACACCAGCAGGT GTTCGAGGCTTGGCTCGAAGTGCAGGACATCGTGGCGAACGCC GCCCGATGAGCCGCGCCTTACGCTGGCTGCCAGCCGTTCGCGG GCTGGTTGGTGCAGCGCGTCGAGCGGTTAGAGGCCCTGCGGTG TTCCACCACCGCAGGCCTCGCCCTTTTTAAGGCTGAATTTGCTTG TCTCCGAATCCAACTGGCTTGTCCAAGGGTGTATCTACGCTTAGT CCAAAGTTCAAACGAGGGGATTACACATGACCAACTTCGATAACG TTCTCGGCTCGATCTG Theophylline GGTGATACCAGCATCGTCTTGATGCCCTGGCAGCACCCTGCTAA riboswitch GGAGGCAACAAG (SEQ ID NO: 13)

E. coli strains were grown at 37° C. in standard Luria Bertani (LB) or 2YT liquid medium or on solid medium (LA) supplemented with 50 μg/ml kanamycin (kan). Mycobacterium smegmatis strains were grown at 37° C. shaking in Middlebrook 7H9 liquid medium (Difco) supplemented with 0.085% NaCl, 0.2% glucose, 0.2% glycerol and 0.05% Tween80, or on Middlebrook 7H10 solid medium (Difco) supplemented with 0.085% NaCl, 0.2% glucose and 0.5% glycerol. Kanamycin was used at a concentration of 50 μg/mL kanamycin.

Cloning of Shuttle Plasmids for Integration into Mycobacterium Smegmatis

The following elements were cloned and prepared in E. coli to obtain the plasmid shown in FIG. 1 :

-   1. Hsp60 promoter (SEQ ID NO:1): A well characterised heat     responsive promoter in mycobacteria that allows for induction of     high levels of messenger RNA when the bacterium is exposed to heat     in the form of 42° C. -   2. SARS-CoV-2 targets: pAura1 comprising the RNA-dependent RNA     polymerases RdRp1 (SEQ ID NO:2) and RdRp2 (SEQ IDNO:3), the N     protein (SEQ ID NO:4) and the E protein (SEQ ID NO:5). These are the     regions of the viral genetic material that are targeted by     diagnostic machines and tests such as the GeneXpert®. -   3. Artemin (SEQ ID NO:8): An RNA binding protein that stabilizes the     RNA molecule. It binds RNA and prevents it from being degraded. The     sequence was obtained from the genome of Artemia salina (Taxonomy     ID: 85549). The inclusion of the artemin protein is to facilitate     stabilization of RNA at room temperature for prolonged periods of     time. This allows for easier distribution and use of the controls. -   4. DNAse (SEQ ID NO:9): DNAse derived from the thermophilic     Geobacillus bacteriophage GBSV1 (Taxonomy ID: 365048). This DNAse is     inactive at room temperature and at 37° C., the temperature at which     the biomimetic control composition is cultured. However, when the     temperature is shifted to temperatures above 55° C., with an optimum     of around 60° C., the DNAse becomes active and degrades DNA. This     temperature sensitivity for activity allows for condition-dependent     DNA degradation, which prevents interference by DNA. -   5. Plasmid backbone (SEQ ID NO:10): Contains the basic elements     required for maintaining the plasmid in E. coli for cloning steps,     including the pUC57 origin of replication. -   6. Kanamycin resistance cassette (Nptll) (SEQ ID NO:11): The     cassette carries the open reading frame for the kanamycin resistance     gene -   7. Mycobacterial origin of replication from pYUB12 (SEQ ID NO:12):     Cloned from pYUB12 as an EcoRv-Hpal fragment. -   8. Theophylline riboswitch (SEQ ID NO:13): This is a regulatory     element that allows for translation of the RNA for the DNAse into     protein. It facilitates tight regulation of the DNAse so that DNA     degradation that is not regulated can be toxic to the cell during     growth.

The resulting plasmid was electroporated into M. smegmatis mc²155 by standard laboratory methods. Briefly, electrocompetent mc²155 were prepared as follows: Cells were grown to log phase (OD₆₀₀0.5-0.9) and harvested by centrifugation (3 500 rpm, 10 min, 4° C.). The pelleted cells were then washed three times by gentle resuspension in 10 ml ice-cold 10% glycerol and cells pelleted by centrifugation between washes (3 500 rpm, 10 min, 4° C.). The cells were resuspended in an appropriate volume of ice-cold 10% glycerol and used immediately. For transformation, 400 μl of electro-competent cells were transferred to pre-chilled 0.2 cm electroporation cuvettes (Bio-Rad), together with plasmid DNA. The Gene PulserX cell (Bio-Rad) was used for electroporation set at 2.5 kV, 25 μF and 10000. Cells were immediately rescued with 1 ml of 2× TY for 3 hours or overnight at 37° C. with shaking at 100 rpm. The rescued cells were plated on Middlebrooks Medium 7H9 supplemented with glucose, NaCl and kanamycin (25 μg/mL) for maintenance of the episomal shuttle plasmid.

Cultures were grown overnight at 37° C. and once sufficiently dense (OD₆₀₀>2), the culture was transferred to 42° C. to induce expression of the SARS-CoV-2 targets, artemin and DNAse cassette (SEQ ID NO:16) and grown shaking for an additional 30 minutes. Theophylline was then added at a concentration of 2 mM for 2 hours at 37° C. The temperature was then raised to 60° C. for 30 min to induce the DNAse. Cultures were split into 22.5 ml aliquots, in 50 ml conical bottom centrifuge, and incubated at 80° C. for 70 minutes.

The cells were harvested by centrifugation at 3080×g for 10 minutes. Thereafter, the pellets were resuspended in 15 ml of 1×PBS solution with 0.05% Tween-80 and 32 ml SR buffer supplied by Cepheid® with GeneXpert® MTB/Ultra. Cells were incubated for 60 minutes at room temperature and the cells were harvested by centrifugation at 3080×g for 10 minutes. The pellets were resuspended in 45 ml of 1×PBS solution with 0.05% Tween-80. The cells were harvested by centrifugation at 3080×g for 10 minutes.

The pellets were resuspended in 15 ml of 1×PBS solution with 0.05% Tween-80. Clones were stored at 4° C. and used for analysis in standard GeneXpert® diagnostics in parallel with clinical samples.

Example 2 Analysis of Strains by Standard GeneXpert® Laboratory Diagnostics for SARS-CoV-2 Assay

Single colonies of modified Mycobacterium smegmatis were picked and resuspended in 500 μl of PBS and heated for 20 min at 60° C. Then, resulting liquid was added to the Xpert® Xpress SARS-CoV-2 cartridge. Three different colonies were picked and processed this way.

The digital output of the Xpert®-MTB/RIF tests was analysed and the results are shown in Table 2. All samples gave a test result of “SARS-CoV-2 POSITIVE”, indicating that the non-infectious biomimetic control cells mimic the diagnostic profile of SARS-CoV-2.

TABLE 2 Results from the Xpert ® SARS-CoV-2 assays Probe C SARS- Probe A Probe B (SPC/Internal CoV-2 (E protein) (N protein) Control) Sample Detected C_(t) End C_(t) End C_(t) End C12 YES 27.4 435 + 29.5 275 + 28.1 386 N/A BN2 YES 27.6 467 + 29.6 300 + 28.3 370 N/A C13 YES 23.2 440 + 25.1 294 + 28.1 435 N/A

Example 3 Development of SARS-CoV-2 Genome Library for Use as Nucleic Acid Sequence Inserts in the Diagnostic Control Constructs

In order to ensure that SARS-CoV-2 targets could span the entire SARS-CoV-2 genome, a library of the whole SARS-CoV-2 genome was constructed. The constructed genome includes 11 fragments as shown in Table 3 below.

Briefly, the genomic RNA was obtained by culture in a BSLIII facility, by tissue culture of the virus in the Chimpanzee Vero cell line. Eleven 3 kbp fragments were obtained by reverse transcription to generate cDNA product. PCR of the cDNA was performed to obtain dsDNA product, which was captured into E. coli replicating plasmid. To cover the entire 30 kbp genome, 11 fragments were designed to allow for overlap between the fragments, thereby ensuring coverage in case the region of interest should lie between fragments.

Any target of interest may be cloned from the library into M. smegmatis using the method described in Example 1 above.

TABLE 3 SARS-CoV-2 genome fragments. SEQ ID NO Nucleotide Sequence SEQ ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTC ID TTGTAGATCTGTTCTCTAAACGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGC NO: 18 TTAGTGCACTCACGCAGTATAATTAATAACTAATTACTGTCGTTGACAGGACACGAGTAA CTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTGTTGCAGCCGATCATCAGCAC ATCTAGGTTTCGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTCCCTGGTTTCA ACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTAC GTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATG GCACTTGTGGCTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATG TGTTCATCAAACGTTCGGATGCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGG TAGCAGAACTCGAAGGCATTCAGTACGGTCGTAGTGGTGAGACACTTGGTGTCCTTGTCC CTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCTTCTTCGTAAGAACGGTAATA AAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTAGGCGACGAGC TTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTG TTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACA ACTTCTGTGGCCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTG GTAAAGCTTCATGCACTTTGTCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTAT ACTGCTGCCGTGAACATGAGCATGAAATTGCTTGGTACACGGAACGTTCTGAAAAGAGCT ATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAAATTTGACACCTTCAATGGGG AATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAACCAAGGGTTG AAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCAC CAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAA CTTCATGGCAGACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATT TGACTAAAGAAGGTGCCACTACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTT ATTGTCCAGCATGTCACAATTCAGAAGTAGGACCTGAGCATAGTCTTGCCGAATACCATA ATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCGCACTATTGCCTTTGGAGGCT GTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCACGTGCTAGCG CTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACA ACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAAC TTAATGAAGAGATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGG AAACTGTGAAAGGTTTGGATTATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATT TTAAAGTTACAAAAGGAAAAGCTAAAAAAGGTGCCTGGAATATTGGTGAACAGAAATCAA TACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCGTGTTGTACGATCAATTTTCT CCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCCGCTATAACAA TACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTG ATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGA CTTCGCAGTGGCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTG ATTGGCTTGAAGAGAAGTTTAAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTG TTAAATTTATCTCAACCTGTGCTTGTGAAATTGTCGGTGGACAAATTGTCACCTGTGCAA AGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGTAAATAAATTTTTGGCTTTGT GTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTAGGTGAAACAT TTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCC TACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCA CAGAAGTGTTAACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAAC CTACTAGTGAAGCTGTTGAAGCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTA TGTTGCTCGAAATCAAAGACACAGAAAAGTACTGTGCCCTTGCACCTAATATGATGGTAA CAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACA CTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGA TTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAA ATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAAT TACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTG ATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTCTACCCTCCAGATG AGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACT SEQ CTTCACACTCAAAGGCGGTGCACCAACAAAGGTTACTTTTGGTGATGACA ID CTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTTGATGAAAGGA NO: 19 TTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAA ATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAAT TACTTACACCACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTG ATGAGTCTGGTGAGTTTAAATTGGCTTCACATATGTATTGTTCTTTCTACCCTCCAGATG AGGATGAAGAAGAAGGTGATTGTGAAGAAGAAGAGTTTGAGCCATCAACTCAATATGAGT ATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATTTGGTGCCACTTCTGCTGCTC TTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAACAAACTGTTG GTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTC AACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTA GTGGTTATTTAAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAG CTAAAAAGGTAAAACCAACAGTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAG GAGGTGTTGCAGGAGCCTTAAATAAGGCTACTAACAATGCCATGCAAGTTGAATCTGATG ATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAGTTGTGTTTTAAGCGGACACA ATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGTGAAGACATTC AACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTAT TATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTG TTCGCACAAATGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAA GCTTTTTGGAAATGAAGAGTGAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAG AGGAAGTTAAGCCATTTATAACTGAAAGTAAACCTTCAGTTGAACAGAGAAAACAAGATG ATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAACTCTGGAAGAAACTAAGTTCC TCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCAGATTCTGCCA CTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTG ATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCA CTACTGAAATGCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTT ACCCGGGTCAGGGTTTAAATGGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGT GTAAAAGTGCCTTTTACATTCTACCATCTATTATCTCTAATGAGAAGCAAGAAATTCTTG GAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGCAGAAGAAACACGCAAATTAA TGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAATATAAGGGTA TTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAA CAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGC CACTTGGCTATGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTC TCAAAGTGCCAGCTACAGTTTCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTT ATCTTACTTCTTCTTCTAAAACACCTGAAGAACATTTTATTGAAACCATCTCACTTGCTG GTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACAACTAGGTATAGAATTTCTTA AGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCACCTAGATGGTG AAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTA AGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGA CATATGGACAACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAAC CTCATAATTCACATGAAGGTAAAACATTTTATGTTTTACCTAATGATGACACTCTACGTG TTGAGGCTTTTGAGTACTACCACACAACTGATCCTAGTTTTCTGGGTAGGTACATGTCAG CATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAATGGTTTAACTTCTATTAAAT GGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAAATAGAGTTGA AGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTA ACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTA GAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGA ACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTA TGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGT GTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAG CACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTG GTAATTACCAGTGTGGTCAC SEQ ACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTA ID GAGAAACAATGAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGA NO: 20 ACGTGGTGTGTAAAACTTGTGGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTA TGTACATGGGCACACTTTCTTATGAACAATTTAAGAAAGGTGTTCAGATACCTTGTACGT GTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTCACCTTTTGTTATGATGTCAG CACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGTGAGTACACTG GTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAG ACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACA AAGAAAACAGTTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTT GTACAGAAATTGACCCTAAGTTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAG AGCAACCAATTGATCTTGTACCAAACCAACCATATCCAAACGCAAGCTTCGATAATTTTA AGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAACCAGTTAACTGGTTATAAGA AACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGTGATGTGGTGG CTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAAC CTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGT GTATACGTTGTCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGA AGTCAGAGGACGCGCAGGGAATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTG AAGAAGTAGTGGAAAATCCTACCATACAGAAAGACGTTCTTGAGTGTAATGTGAAAACTA CCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAATAGTTTAAAAATTACAGAAG AGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTTACTATTAAGA AACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTG CTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAG TTGTTAGTACAACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATA TGCCTTATTTCTTTACTTTATTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTA GAATTAAAGCATCTATGCCGACTACTATAGCAAAGAATACTGTTAAGAGTGTCGGTAAAT TTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAATTTTTCTAAACTGATAAATA TTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTACTCAACCGCTG CTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAG GCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTA GTGTTTGTCTTAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAA TTACCATTTCATCTTTTAAATGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTT TGGCATATATTCTTTTCACTAGGTTTTTCTATGTACTTGGATTGGCTGCAATCATGCAAT TGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTCTTGGCTTATGTGGTTAATAA TTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATCTTCTTTGCAT CATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTT GTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATG GTGTTAGAAGGTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACA ATTGGAATTGTGTTAATTGTGATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAG TTGCGAGAGACTTGTCACTACAGTTTAAAAGACCAATAAATCCTACTGACCAGTCTTCTT ACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCATCTTTACTTTGATAAAGCTG GTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGACAACCTGAGAG CTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAAT GTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTA TACTGTTACTAGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTA AAATGTTTGATGCTTACGTTAATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAAC TCAAAACACTAGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATG TCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTA AAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATA GTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTG GTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACA TTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAA TACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGAC AAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAA SEQ AGTTGCAACTGCAGAAGCTGAACTTGCAAAGAATGTGTCCTTAGACAATG ID TCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGTTGATTCAGATGTAGAAACTA NO: 21 AAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTTACTGGCGATA GTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTG GTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACA TTGCTTTGATATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAA TACGTAGTGCTGCTAAAAAGAATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGAC AAGTTGTTAATGTTGTAACAACAAAGATAGCACTTAAGGGTGGTAAAATTGTTAATAATT GGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTTTGTTGCTGCTATTTTCTATT TAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAAATCATAGGAT ACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTA ACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACA AAGCTTGCCCATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTT TGCCTGGCACGATATTACGCACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTT TTAGTGCAGTTGGTAACATCTGTTACACACCATCAAAACTTATAGAGTACACTGACTTTG CAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTTTAAAGATGCTTCTGGTAAGC CAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTATGAAAGTTTAC GCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACC TTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTT GTGAAAGATCAGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATG ATTATTACAGATCTTTACCAGGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTA ATATGTTTACACCACTAATTCAACCTATTGGTGCTTTGGACATATCAGCATCTATAGTAG CTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTACTATTTTATGAGGTTTAGAA GAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTCCTTATGTCAT TCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTT ACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGA TGGTTATGTTCACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTT CCACAAAGCATTTCTATTGGTTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATG GTGTTTCCTTTAGTACTTTTGAAGAAGCTGCGCTGTGCACCTTTTTGTTAAATAAAGAAA TGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTACGCAATATAATAGATACTTAG CTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGCTACAGAGAAG CTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTC TTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAA TGGCATTCCCATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTA CACTTAACGGTCTTTGGCTTGATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCT CTGAAGACATGCTTAACCCTAATTATGAAGATTTACTCATTCGTAAGTCTAATCATAATT TCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGGACATTCTATGCAAAATTGTG TACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAGTTTGTTCGCA TTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTT ACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTG GTAGTGTTGGTTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGG AATTACCAACTGGAGTTCATGCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTG TTGACAGGCAAACAGCACAAGCAGCTGGTACGGACACAACTATTACAGTTAATGTTTTAG CTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTTTCTCAATCGATTTACCACAA CTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTAACACAAGACC ATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTG CTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTT TATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCC AAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGA CTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAA ATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTG TCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGT SEQ ACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTG ID CTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTT NO: 22 TATTAGAAGATGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCC AAAGTGCAGTGAAAAGAACAATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGA CTTCACTTTTAGTTTTAGTCCAGAGTACTCAATGGTCTTTGTTCTTTTTTTTGTATGAAA ATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTCTGCTTTTGCAATGATGTTTG TCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCCACTGTAGCTT ATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATA TGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTG TAGTGTTACTAATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGT GGACACTTATGAATGTCTTGACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATC AAGCCATTTCCATGTGGGCTCTTATAATCTCTGTTACTTCTAACTACTCAGGTGTAGTTA CAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTGTGTTGAGTATTGCCCTATTT TCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTCTTAGGCTATT TTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTG GTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTAC TCCCACCCAAGAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTG GCAAACCTTGTATCAAAGTAGCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACAT CAGTAGTCTTACTCTCAGTTTTGCAACAACTCAGAGTAGAATCATCATCTAAATTGTGGG CTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAAAGATACTACTGAAGCCTTTG AAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTAGACATAAACA AGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTA GTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTG CTAATGGTGATTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAAT CTGAATTTGACCGTGATGCAGCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTA TGACCCAAATGTATAAACAGGCTAGATCTGAGGACAAGAGGGCAAAAGTTACTAGTGCTA TGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAATGATGCACTCAACAACATTA TCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACAACAGCAGCCA AACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACAT TTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTG TTCAACTTAGTGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAA CAGCTTTAAGGGCCAATTCTGCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCAC TACGACAGATGTCTTGTGCTGCCGGTACTACACAAACTGCTTGCACTGATGACAATGCGT TAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACTTGCACTGTTATCCGATTTAC AGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATCTATACAGAAC TGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTAT ACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCA CAGTACGTCTACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTT TCTGTGCTTTTGCTGTAGATGCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGAC AACCAATCACTAATTGTGTTAAGATGTTGTGTACACACACTGGTACTGGTCAGGCAATAA CAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGGTGGTGCATCGTGTTGTCTGT ACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTAAAAGGTAAGT ATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAG TCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAAC CCATGCTTCAGTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGC CCGTCTTACACCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACAT CTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCA AGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTT CTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGC TAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCA ACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGG TAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGA SEQ CCGTGCGGCACAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACAT ID CTACAATGATAAAGTAGCTGGTTTTGCTAAATTCCTAAAAACTAATTGTTGTCGCTTCCA NO: 23 AGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTTGTAGTTAAGAGACACACTTT CTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTCCAGCTGTTGC TAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCA ACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGG TAATTGTGACACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTT CAATAAAAAGGACTGGTATGATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAA CTTAGGTGAACGTGTACGCCAAGCTTTGTTAAAAACAGTACAATTCTGTGATGCCATGCG AAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAAGATCTCAATGGTAACTGGTA TGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTGTAGATTCTTA TTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGT TGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGA AGAGAGGTTAAAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAA TTGTGTTAACTGTTTGGATGACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATT CTCTACAGTGTTCCCACCTACAAGTTTTGGACCACTAGTGAGAAAAATATTTGTTGATGG TGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAGCTAGGTGTTGTACATAATCA GGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGTATGCTGCTGA CCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTC AGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAA AGACTTCTATGACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATT AAAACACTTCTTCTTTGCTCAGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCG TTATAATCTACCAACAATGTGTGATATCAGACAACTACTATTTGTAGTTGAAGTTGTTGA TAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCTAACCAAGTCATCGTCAACAA CCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGACTTTATTATGA TTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCC TACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGT AGCTGGTGTCTCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAA ATCAATAGCCGCCACTAGAGGAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGG TTGGCACAACATGTTAAAAACTGTTTATAGTGATGTAGAAAACCCTCACCTTATGGGTTG GGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTTAGAATTATGGCCTCACTTGT TCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATAGATTAGCTAA TGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACC AGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTG TCAAGCTGTCACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGA TAAGTATGTCCGCAATTTACAACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGT TGACACAGACTTTGTGAATGAGTTTTACGCATATTTGCGTAAACATTTCTCAATGATGAT ACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTATGCATCTCAAGGTCTAGTGGC TAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTATGTCTGAAGC AAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATAC AATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAAT CCTAGGGGCCGGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGA ACGGTTCGTGTCTTTAGCTATAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTA TGCTGATGTCTTTCATTTGTACTTACAATACATAAGAAAGCTACATGATGAGTTAACAGG ACACATGTTAGACATGTATTCTGTTATGCTTACTAATGATAACACTTCAAGGTATTGGGA ACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTG TGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTT ATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGT TAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTC SEQ TATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTGTTGGGGCTTG ID TGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTT NO: 24 ATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGT TAATCCGTATGTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTT AGGAGGTATGAGCTATTATTGTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGC TAATGGACAAGTTTTTGGTTTATATAAAAATACATGTGTTGGTAGCGATAATGTTACTGA CTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGTGATTACATTTTAGCTAACAC CTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTGAGGAGACATT TAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCT TTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTA TCGTGTAACTAAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTA TGGTGATGCTGTTGTTTACCGAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTT TGTGCTGACATCACATACAGTAATGCCATTAAGTGCACCTACACTAGTGCCACAAGAGCA CTATGTTAGAATTACTGGCTTATACCCAACACTCAATATCTCAGATGAGTTTTCTAGCAA TGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGGGACCACCTGG TACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGT GTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTT GCCTATAGATAAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAA ATTCAAAGTGAATTCAACATTAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGA GACGACAGCAGATATAGTTGTCTTTGATGAAATTTCAATGGCCACAAATTATGATTTGAG TGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTACATTGGCGACCCTGCTCAATT ACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATTTCAATTCAGT GTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCC TGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAA AGACAAATCAGCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTC ATCTGCAATTAACAGGCCACAAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGC TTGGAGAAAAGCTGTCTTTATTTCACCTTATAATTCACAGAATGCTGTAGCCTCAAAGAT TTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGCTCAGAATATGACTATGTCAT ATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTAATGTTGCTAT TACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTT GCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGT AACAGGACTCTTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACC TACACACCTCAGTGTTGACACTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGG CATACCTAAGGACATGACCTATAGAAGACTCATCTCTATGATGGGTTTTAAAATGAATTA TCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAAGAAGCTATAAGACATGTACG TGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTGTTGGTACCAA TTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTA TGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGA TCAATTTAAACACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTAT AAAGATTGTACAAATGTTAAGTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGT CTTATGGGCACATGGCTTTGAGTTGACATCTATGAAGTATTTTGTGAAAATAGGACCTGA GCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTTTCCACTGCTTCAGACACTTA TGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTATGATTGATGT TCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCA TGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGA GTGCTTTGTTAAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAA GATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGA CAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGC TGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGA AGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATT TTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAG AGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATG SEQ AAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAA ID GATTAATGCGGCTTGTAGAAAGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGA NO: 25 CAAATTCCCAGTTCTTCACGACATTGGTAACCCTAAAGCTATTAAGTGTGTACCTCAAGC TGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGTGACAAAGCTTATAAAATAGA AGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTGTATGCCTATT TTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAG AGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACA TGCATTCCACACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATT TTTCTATTACTCTGACAGTCCATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGA TTATGTACCACTAAAGTCTGCTACGTGTATAACACGTTGCAATTTAGGTGGTGCTGTCTG TAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCTTATAACATGATGATCTCAGC TGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGAACACTTTTAC AAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGG ACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGG TGTTGATGTAGAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCT TTGGGCTAAGCGCAACATTAAACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGT GGACATTGCTGCTAATACTGTGATCTGGGACTACAAAAGAGATGCTCCAGCACATATATC TACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAACCAACTGAAACGATTTGTGC ACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTATTTAGAAATGC CCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCC CAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAA TTATTATAAGAAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAG TAGAAATTTACAAGAATTTAAACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGC TATGGATGAATTCATTGAACGGTATAAATTAGAAGGCTATGCCTTCGAACATATCGTTTA TGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTACTGATTGGACTAGCTAAACG TTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTACAGTTAAAAA CTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTT ATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGT TGTCAAAGTGACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCA TGTAGAAACATTTTACCCAAAATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTAT GCCTAATCTTTACAAAATGCAAAGAATGCTATTAGAAAAGTGTGACCTTCAAAATTATGG TGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTCGCAAAATATACTCAACTGTG TCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTATACATTTTGG TGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTAC GGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTT GATTGGTGATTGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATAT GTACGACCCTAAGACTAAAAATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCAC TTACATTTGTGGGTTTATACAACAAAAGCTAGCTCTTGGAGGTTCCGTGGCTATAAAGAT AACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATGGGACACTTCGCATGGTGGAC AGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTGGATGTAATTA TCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTG GAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCC CCTTAAATTAAGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGAT TTTATCTCTTCTTAGTAAAGGTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTC TAGTGATGTTCTTGTTAACAACTAAACGAACAATGTTTGTTTTTCTTGTTTTATTGCCAC TAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTA ATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTTTTACATT CAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATG TCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTG TTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTT TAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAG TCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACA AAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAG SEQ TCAGTGTGTTAATCTTACAACCAGAACTCAATTACCCCCTGCATACACTA ID ATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCAGTlliACATT NO: 26 CAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATG TCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTG TTTATTTTGCTTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTT TAGATTCGAAGACCCAGTCCCTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAG TCTGTGAATTTCAATTTTGTAATGATCCATTTTTGGGTGTTTATTACCACAAAAACAACA AAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGCGAATAATTGCACTTTTGAAT ATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTCAAAAATCTTA GGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTA TTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGC CAATAGGTATTAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGA CTCCTGGTGATTCTTCTTCAGGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATC TTCAACCTAGGACTTTTCTATTAAAATATAATGAAAATGGAACCATTACAGATGCTGTAG ACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTTGAAATCCTTCACTGTAGAAA AAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATTGTTAGATTTC CTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTG TTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATA ATTCCGCATCATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATC TCTGCTTTACTAATGTCTATGCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAA TCGCTCCAGGGCAAACTGGAAAGATTGCTGATTATAATTATAAATTACCAGATGATTlTA CAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTCTAAGGTTGGTGGTAATTATA ATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGAGATATTTCAA CTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACT TTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAG TAGTAGTACTTTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGT CTACTAATTTGGTTAAAAACAAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAG GTGTTCTTACTGAGTCTAACAAAAAGTTTCTGCCTTTCCAACAATTTGGCAGAGACATTG CTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGAGATTCTTGACATTACACCAT GTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAACCAGGTTGCTG TTCTTTATCAGGATGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTA CTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTT TAATAGGGGCTGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTA TATGCGCTAGTTATCAGACTCAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTC AATCCATCATTGCCTACACTATGTCACTTGGTGCAGAAAATTCAGTTGCTTACTCTAATA ACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTACCACAGAAATTCTACCAGTGT CTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCAACTGAATGCA GCAATCTTTTGTTGGAATATGGGAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAA TAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACA AAACACCACCAATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCAT CAAAACCAAGCAAGAGGTCATTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAG ATGCTGGCTTCATCAAACAATATGGTGATTGCCTTGGTGATATTGCTGCTAGAGACCTCA TTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACCTTTGCTCACAGATGAAATGA TTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGGACCTTTGGTG CAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTG GAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTG CTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAG ATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATT TTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTG AAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTC AACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGT CAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTA TGTCCTT SEQ GAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTG ID CTATTGGCAAAATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAG NO: 27 ATGTGGTCAACCAAAATGCACAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATT TTGGTGCAATTTCAAGTGTTTTAAATGATATCCTTTCACGTCTTGACAAAGTTGAGGCTG AAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAGTTTGCAGACATATGTGACTC AACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCTACTAAAATGT CAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTA TGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTG CACAAGAAAAGAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTC CTCGTGAAGGTGTCTTTGTTTCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTT ATGAACCACAAATCATTACTACAGACAACACATTTGTGTCTGGTAACTGTGATGTTGTAA TAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACCTGAATTAGACTCATTCAAGG AGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTAGGTGACATCT CTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTG CCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATA TAAAATGGCCATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGG TGACAATTATGCTTTGCTGTATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTT GTGGATCCTGCTGCAAATTTGATGAAGACGACTCTGAGCCAGTGCTCAAAGGAGTCAAAT TACATTACACATAAACGAACTTATGGATTTGTTTATGAGAATCTTCACAATTGGAACTGT AACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTCGCGCTACTGC AACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCT TGCTGTTTTTCAGAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACT CTCCAAGGGTGTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACA CCTTTTGCTCGTTGCTGCTGGCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTA CTTCTTGCAGAGTATAAACTTTGTAAGAATAATAATGAGGCTTTGGCTTTGCTGGAAATG CCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTTCTTTGCTGGCATACTAATTG TTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTACTTCAGGTGA TGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATG GGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCA GCTGTACTCAACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTA CAATAAAATTGTTGATGAGCCTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATC CGGAGTTGTTAATCCAGTAATGGAACCAATTTATGATGAACCGACGACGACTACTAGCGT GCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTACTCATTCGTTTCGGAAGAGAC AGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTATTCTTGCTAGT TACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGT GAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAG AGTTCCTGATCTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACT TTAATTTTAGCCATGGCAGATTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTC CTTGAACAATGGAACCTAGTAATAGGTTTCCTATTCCTTACATGGATTTGTCTTCTACAA TTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTAAGTTAATTTTCCTCTGGCTG TTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAATAAATTGGATC ACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTC ATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACT AACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGT GAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTA GGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACTGTTGCTACATCACGAACGCTT TCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTGACTCAGGTTTTGCTGCATAC AGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAGCAG SEQ TCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCT ID GTTTACAGAATAAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGC NO: 28 TTGATGTGGCTCAGCTACTTCATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATG TGGTCATTCAATCCAGAAACTAACATTCTTCTCAACGTGCCACTCCATGGCACTATTCTG ACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGTGATCCTTCGTGGACATCTT CGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAAGAAATCACT GTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGT GACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGAC CATTCCAGTAGCAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCA TCTCGTTGACTTTCAGGTTACTATAGCAGAGATATTACTAATTATTATGAGGACTTTTAA AGTTTCCATTTGGAATCTTGATTACATCATAAACCTCATAATTAAAAATTTATCTAAGTC ACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAACCAATGGAGATTGATTAAAC GAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGAGCTTTATCA CTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAAC ATACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTT TAGCACTCAATTTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGC CAGATCAGTTTCACCTAAACTGTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCC AATTTTTCTTATTGTTGCGGCAATAGTGTTTATAACACTTTGCTTCACACTCAAAAGAAA GACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTGCTTTTTAGCCTTTCTGCTA TTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAAGATCATAAT GAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACT GTAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTA GTTGATGACCCGTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGA AAATCAGCACCTTTAATTGAATTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAG TACATCGATATCGGTAATTATACAGTTTCCTGTTTACCTTTTACAATTAATTGCCAGGAA CCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGAAGACTTTTTAGAGTATCAT GACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTAAAATGTCTGATAATGGAC CCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCA GTAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTAC CCAATAATACTGCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAAT TCCCTCGAGGACAAGGCGTTCCAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCT ACTACCGAAGAGCTACCAGACGAATTCGTGGTGGTGACGGTAAAATGAAAGATCTCAGTC CAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCTGGACTTCCCTATGGTGCTA ACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAAAAGATCACA TTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAA CATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTT CCTCATCACGTAGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAACTT CTCCTGCTAGAATGGCTGGCAATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACA GATTGAACCAGCTTGAGAGCAAAATGTCTGGTAAAGGCCAACAACAACAAGGCCAAACTG TCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGGCAAAAACGTACTGCCACTA AAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCCAAGGAAATT TTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCAC AATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACAC CTTCGGGAACGTGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATT TCAAAGATCAAGTCATTTTGCTGAATAAGCATATTGACGCATACAAAACATTCCCACCAA CAGAGCCTAAAAAGGACAAAAAGAAGAAGGCTGATGAAACTCAAGCCTTACCGCAGAGAC AGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGATTTGGATGATTTCTCCAAAC AATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATGCAGACCACA CAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCT TGTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCA CATAGCAATCTTTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTT CACCGAGGCCACGCGGAGTACGATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCT ATATGGAAGAGCCCTAATGTGTAAAATTAATTTTAGTAGTGCTATCCCCATGTGATTTTA AT

Example 4 Cloning of Shuttle Plasmids with HCV and HIV Nucleic Acid Targets and Integration into Mycobacterium Smegmatis

Nucleic acid targets for both HCV and HIV were cloned and a construct was prepared in E. coli as described in Example 1, with the nucleic acid targets for HCV and HIV replacing the SARS-CoV-2 targets.

The nucleic acid fragments were derived from publicly available sequences for HIV (NCBI Accession Number NC_001802.1) and HCV (NCBI Accession Number NC_004102.1).

The resulting plasmid was electroporated into M. smegmatis mc²155 by standard laboratory methods, as described in Example 1.

The recombinant M. smegmatis containing the HIV and HCV targets was added to the Xpert® HIV-1 Viral Load and Xpert® HCV Viral Load cartridges. The digital output of the Xpert® HIV-1 Viral Load and Xpert® HCV Viral Load tests was analysed and a positive result was obtained using the recombinant M. smegmatis containing the HIV and HCV genome fragments. 

The invention claimed is:
 1. A nucleotide cassette having the following formula: X₁-X₂-X₃-X₄-X₅ wherein, X₁ is an inducible promoter; X₂ is a nucleotide sequence corresponding to at least one single stranded RNA diagnostic target; X₃ is a nucleotide sequence that encodes an artemin protein from Artemia salina; X₄ is a molecular switch; and X₅ is a nucleotide sequence that encodes a DNAse enzyme and is under control of the molecular switch, wherein the single stranded RNA diagnostic target is a sequence detected by a molecular diagnostic assay.
 2. The nucleotide cassette of claim 1, wherein the single stranded RNA diagnostic target is a SARS-CoV-2 target.
 3. The nucleotide cassette of claim 2, wherein the SARS-CoV-2 target is selected from the group consisting of RdRP1, RdRP2, N protein, E protein, Spike protein, ORF1ab, and combinations thereof.
 4. The nucleotide cassette of claim 1, wherein X₂ is selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, and combinations thereof.
 5. The nucleotide cassette of claim 1 to wherein the inducible promoter is Hsp60.
 6. The nucleotide cassette of claim 1, wherein the molecular switch is a theophylline riboswitch and wherein the DNAse enzyme is produced in the presence of theophylline.
 7. The nucleotide cassette of claim 1, comprising the sequence of SEQ ID NO:14 or SEQ ID NO:15.
 8. An RNA expression product produced by the nucleotide cassette of claim 1 or having the nucleotide sequence of SEQ ID NO:16 or SEQ ID NO:17.
 9. A vector comprising the nucleotide cassette of any one of claim
 1. 10. A cell comprising the nucleotide cassette of claim
 1. 11. The cell of claim 10, wherein the host cell is a recombinant Mycobacterium smegmatis cell.
 12. A diagnostic control composition comprising a non-pathogenic recombinant bacterium having a modified genetic content comprising a nucleotide cassette having the following formula: X₁-X₂-X₃-X₄-X₅ wherein, X₁ is an inducible promoter; X₂ is a nucleotide sequence corresponding to at least one single stranded RNA diagnostic target; X₃ is a nucleotide sequence that encodes an artemin protein from Artemia salina; X₄ is a molecular switch; and X₅ is a nucleotide sequence that encodes a DNAse enzyme and is under control of the molecular switch, wherein the single stranded RNA diagnostic target is a sequence detected by a molecular diagnostic assay and wherein the diagnostic control composition mimics the diagnostic profile of a clinical sample in the molecular diagnostic assay.
 13. The diagnostic control composition of claim 12, wherein the single stranded RNA diagnostic target is a SARS-CoV-2 target.
 14. The diagnostic control composition of claim 13, wherein the SARS-CoV-2 target is selected from the group consisting of RdRP1, RdRP2, N protein, E protein, Spike protein, ORF1ab, and combinations thereof.
 15. The diagnostic control composition of claim 12, wherein X₂ is selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, and combinations thereof.
 16. The diagnostic control composition of claim 12, wherein the inducible promoter is Hsp60.
 17. The diagnostic control composition of claim 12, wherein the molecular switch is a theophylline riboswitch and wherein the DNAse enzyme is produced in the presence of theophylline.
 18. The diagnostic control composition of claim 12, wherein the nucleotide cassette comprises the sequence of SEQ ID NO:14 or SEQ ID NO:15.
 19. The diagnostic control composition of claim 12, wherein the non-pathogenic recombinant bacterium is Mycobacterium smegmatis.
 20. A method of producing a recombinant bacterium that mimics the diagnostic profile of a single stranded RNA of interest in a molecular diagnostic assay, the method comprising: transforming a non-pathogenic bacterium with a vector comprising the nucleotide cassette of claim 1 and a selection marker to obtain a recombinant bacterium; and (ii) culturing the recombinant bacterium obtained in step (i) under selective conditions in order to select the recombinant bacterium.
 21. The method of claim 20, wherein the non-pathogenic bacterium is Mycobacterium smegmatis.
 22. The method of claim 20, wherein the recombinant bacterium mimics the diagnostic profile of SARS-CoV-2.
 23. The method of claim 20, wherein the selection marker is an antibiotic selection marker.
 24. The method of claim 20, wherein the recombinant bacterium is either stably or transiently transformed with the vector.
 25. A kit comprising the cell of claim 10 and instructions for use.
 26. A kit comprising the diagnostic control composition of claim 12, and instructions for use. 